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(54) Polymorphism detection 

(57) The present invention generally provides a rap- 
id efficient method for analyzing polymorphic or biallelic 
markers, and arrays for carrying out these analyses. In 
general, the methods of the present invention employ 
arrays of oligonucleotide probes that are complementa- 
ry to target nucleic acids which correspond to the marker 



sequences of an individual. The probes are typically ar- 
ranged in detection blocks, each block being capable of 
discriminating the three genotypes for a given marker, 
e.g., the heterozygote or either of the two homozygotes. 
The method allows for rapid, automatable analysis of 
genetic linkage to even complex polygenic traits. 
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Description 

BACKGROUND OF THE INVENTION 

The relationship between structure and function of macromolecules is of fundamental importance in the ^under- 
standTnqS broial systems. These relationships are important to understanding, for example the functus o [ en- 
^merst^ural^proteL and signalling proteins, ways in which cells communicate with each other, as wel. as mech- 

T^",T^^ ,e Processes. Life is ^^^^^^ 

tl o rooert es of These polypeptides, e.g., as enzymes, functional proteins, and structural protons, are determ.ned by 
he seque ce o am no als which ma'ke them up. As structure and function are integrally related, many Mag* 
Jettons may be expla.ned by elucidating the underlying structural features which prov.de those functors. nd these 
Zmr ?.? es are determined by the underlying genetic information in the form of polynucleot.de sequences. Further, ,n 
SS^^SXSS^ polynucleotide sequences also can be invoived in control and regu.at™ of gene 

scientific and tfeatmem ot „ varjety of disorders may often be accomplished through ,den- 

mcZoZT^oZ^ of the genetic material which encodes for specific disease associated trarts In order o 
accomplish^ however, one must first identify a correction between a particular gene and a part.cular trart. Th ,s 
aeneXaccompt hed by providing a genetic linkage map through which one identifies a set of genetic markers that 
follow a partSTr raU These markers can identify the location of the gene encoding for that trart wrth.n the genome 
evlmuaS to the identification o, the gene. Once the gene is identified, methods of treating the d^order that 

is result from that gene, i.e., as a result of overexpression, constitutive expression, mutation, underexpress.on, etc., can 

69 Tnl dat orotic markers includes variants in the genet, code termed "polymorphisms.. In the course , o, 
evolSion he genome of a species can collect a number of variations in individua. bases. These s.ngle base changes 
are temTed s!ng!e base polymorphisms. Polymorphisms may also exist as stretches of repeating 
as to tneteng h 9 of the repeat from individual to individual. Where these variations are recumng, e.g., exist ,n a sign, cant 
pe centag o a populafen. they can be readily used as markers linked to genes involved in mono- and P^gen.cja^ 
m the human oenome single-base polymorphisms occur roughly once per 300 bp. Though many of these vanant 
La esappeart^TequS 

u To ^ those^rnnq in 20 to 50 % of the allele population) can be found approximately once per k.lobase. Acco ding ly, 
!r " ! humTn"eno7e 9 orapproximately 3 Gb, one would expect to find approximately 3,000,000 of these 'useful poly- 

^T^Tuse of polymorphisms as genetic .inkage markers is thus of critical .mportance in locating, identifying and 
rh^r^tenzTno h^Ss which are responsible for specific traits. In particular, such mapping techn.ques allow for the 
idenSbn ^genes respSble for a variety of disease or disorder-related traits which may be used .n the dagnos.s 
2?StS2lK^J^»-B disorders. Given the size of the human genome, as well as those of other marnmab. 
rwouldgeneS 

The present invention meets these and other needs. 
SUMMARY OF THE INVENTION 

The oresent .nvention generally provides arrays and methods useful in screening large numbers of polymorphic 
markers ZTo^TTnplnlcJ, the present invention provides arrays of oligonucleotide probes for detecting a 
Symo phismTn S n'ucleic acid sequence. The array general.y comprises a, least one de.ect.on bkx* of p^bes 
which includes fi.st and second groups of probes that are complementary to first and second vanants of the target 
nudeic acW secure respectively. The array further comprises third and fourth groups of probes hav.ng a sequence 
SSnd second groups, respective., but including all possib.e ^-substitutions of pos,t,ons , the 
sequence that are within n bases of a base corresponding to the polymorphs, where n .s from 0 to 5 

The present .nvention also provide methods of using the above arrays in screen.ng genom.c mater.a. for the pres- 
ence of the polymorphisms. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic illustration of light-directed synthesis of oligonucleotide arrays. 
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Fiqure 2A shows a schematic representation of a single oligonucleotide array contammg 78 separate detection 
b.ocks Figure 2B shows a schematic illustration of a detection block for a specific polymorphism denoted WI-587 
F otle 2B also shows the triplet layout of detection blocks for the polymorphism emp.oying 20-mer oligonucleotide 
probes SJ5 1 subs«tuls 7 P 10 and 13 bp from the 3' end of the probe. The probes present in the shaded portions 
of each detection block are shown adjacent to each detection block. T nr-rr mr 

Figure 3 illustrates a tiling strategy for a polymorphism denoted WI-567, and hav.ng the sequence 5 -TGCTGCCTT- 
GGTO^^CTSiram" A detection block specific for the WI-567 po.ymorphism is shown with t e probe 
series t£d Serein listed above. Predicted patterns for both homozygous forms and the heterozygous form are 



S F^re 4 shoTa schematic representation of a detection block specific for the po^morphism denoted WM 959 
having the sequence 5'-ACCAAAAATCAGTC[T/C]GGGTAACTGAGAGTG-3' wrth the ^™^™^£^ r 
brackets A fluorescent scan of hybridization of the heterozygous and both homozygous torms are shown ,n the center, 

?5 determines 'JlS^En^^ in DNA are heterozygote, homozygote with a first poiymorphic marker or 

F qu re 7 shows a probLray including probes with base substitutions at base positions within two base pos .ons 
of Theposaic-n of the polymorphic marker is denoted P 0 and which may have one of two 

°' ^^g^ThowsTS level flowchart of analyzing intensities to determine whether ^^TrZ^ 
are hXozygote, homozygote with a first polymorphic marker or homozygote with a second polymeric marker 

heterozyg ya an af tj)ed for detectjng 246 dlffer ent polymorphic markers, both sense 

and MM M dM polymorphism detection block is indicated by a number representing a specific, 
Z^ome6 ^or P ^ m . Figure 10B shows a fluorescent scan of the array following exposure to f.uorescent.y la- 
belled target sequence. 



30 DETAILED DESCRIPTION OF THE INVENTION 



I. General 

The oresent invention generally provides rapid and efficient methods for screening samples of genomic material 

3s ,or S^SS^^ "« ical| y desi 9" ed for car,yin9 out these analyses - ln particular ' ! f h pres T h THZ 

rites to the idTn ification and screening of single base polymorphisms in a sample. In general, he methods of the 
^^riSJSTernptoy arrays of oligonucleotide probes that are complementary to target nuc.eic acid sequence 
seamen s from an individual (e g., a human or other mammal) which target sequences .nclude spec*, c identified po- 
ZXms Tr-po lymorphic markers." The probes are typically arranged in detectbn blocks, each bkx* being capable 
S££g the three genotypes for a given marker, e.g., the heterozygote or either of the two homozygotes. The 
method allows for rapid automatable analysis of genetic linkage to even complex polygenic traits. 

rth";^^ a coLnaJn of P photo,ithogra P hic methods and soHc I phase ^ 

10f ^heTa^strategy for .ight directed synthesis of oligonucleotides on a VLSIPS™ Array is outlined in Figure 1 The 
surtie of a eubS or solki support, modified with photosensKive protecting groups (X) is H.um.nated through a 
ohoSrthoqraPhrmask yielding reactive hydroxyl groups in the illuminated regions. A selected nucleotide, typ.ca.ly 
£ S Z^Tl^^orLm^l^ deoxynucleoside (protected at the 5" hydroxy, with a f^-r^ive 
^otec °ng group), is then presented to the surface and coup.ing occurs at the sites that were exposed to ;''9ht Following 
caooina and oxidation the substrate is rinsed and the surface is illuminated through a second mask, to expose addi- 
tiona hyd^oups to coup.ing. A second se.ected nucleotide (e.g., ^-protected, 30-phosphoram,d,te-act,vated 
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the sequence of the oligonucleotides at each site is known. 
II identification of Polymorphisms 

fhe differences being indicate of potential polymorphism. Alternative array-based "^W*«>» 
mnrriin (i e one that occurs in 20 to 50% of the allele population) occurs approximately once per Ikb in a gwen 

as a useful marker for genetic linkage analysis. 
Ill Screening Polymorphisms 

mmmmmmm 
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detection block These monosubstituted probes have bases at and up to a certain number of bases in either direction 
J^hTpS^IZn. substituted with tha remaining nuclides (selected from A, T, G, C or UMVP^. *• ^bes 
nTt led detection block will include substitutions of the sequence positions up to and .nclud.ng those that are 5 bases 
corresponds to the polymorphism. Preferably, bases up to and including 
s bates Ln the polymorphism will be substituted. The monosubstituted probes prov.de .nternal controls lor the t led 
arrly '£ SnSS factual hybridization from artifactual cross-hybridization. An examp.e of th* preferred subst.tut.on 

'^TJ^^^urstons may also be employed to ensure optima, dilation of perfect.* r hybridizing 
probes Forexample a detection block may be ti.ed to provide probes having opt.ma. ^^^^ 

10 minimal cross-hybridization. For example, where a sequence downstream from a polymorphs base s G-C rich, t could 
ZmiTve rise to a higher level of cross-hybridization or -noise," when analyzed. Accord.ng.y one , can tile Jhe 
de ec ion btock to take advantage of more of the upstream sequence. Such alternate t.l.ng conf.gurat.ons are sche- 
maSy ihu° tra ed in Figure 2B bottom, where the base in the probe that is complementary to the polymorphism is 
Sac d a" dSSnt positions in the sequence of the probe relate to the 3' end of the probe. For ease of d.scu^ 

is both the base which represents the po^morphism and the complementary base .n the probe are referred to here.n as 

^ SKS ^:Xo^:X^^ for any part.cular po^morph.sm ^^Z^sZ 
exaSrtriplet or larger detection blocks like those illustrated in Figure 2B may be read.ly employed to select such 

OP,i T;ays may beS for one or both strands o, the target sequence, i.e., the sequence including the Polymorphism 
TheSsion o» probes that hybridize to both the sense and antisense strands, either on a s.ng.e array or separate 
Irrays provides an additional .eve. of verification for a given interogation. Thus, in addition to probes hat •« compto- 
mTntary to one sdtrand of a target sequence, a detection block will a.so include probes ha are comP.eme'.tary to the 
aTsense strand of the target sequence, and which are therefor complementary to the first group of probes. 

Additionany, arrays wilfgenerally be ti.ed to provide for ease of reading and analysis. For examp.e, the probes ^ed 
wfth£ a detectton b.ock wilfgenerally be arranged so that reading across a detection block, the probes are t led ,n 
Tuccession i e , progressing along the target sequence one or more bases at a t.me (See, e.g., F.gure 3, m,ddte _ 

One Tan ar «y is appropriately ti.ed for a given polymorphism or set of polymorphisms, the target nuclei aad .s 
hvbrSed with the array and scanned. Hybridization and scanning are generally earned out by methods desenbed .n. 
e g Pub^hed PCT Application Nos. WO 92/10092 and WO 95/11995, and U.S. Patent No. 5.424,186, prev.ou^y 
Tncorporated herein by reference in their entirety for a., purposes In brief, a targe, nuc.eic ac.d sequence* M includes 
one or more prev.ous^ identified polymorphic markers is amplified by well known amplrf .cation techn.aues e g PCR 
Typica.rthis nvoK/es the use of primer sequences that are comp.ementary to the two strands of the target sequence 
flom upst earn and downstream from the polymorphism. Asymmetric PCR techniques may ^^"^JS 
an array is ti.ed for only a sense or antisense strand. Amplified target, generally incorporating a label, is *en hybnd.zed 
wtth he array under appropriate conditions. Incorporation of a label generally .nvolves .ncorporat.ng a labeled nueto- 
Se into meamplificaSon reaction, whereby the label is incorporated into the target nucle.c ^^^^ 
include fluorescent labels coupled to nucleotides, as well as well known b.nd.ng groups, e.g., b.ot.n, streptavid.n and 

the like to which a labelled complement may be later bound. 

Upon completion of hybridization and washing of the array, the array is scanned to determ.ne the pos*,on o the 
array to wteh the target sequence hybridizes. The hyridization data obtained from the scan, . a, .n the form of fluo- 
resce^ or some other detectable label or dye, is then plotted as a funct.on of locat.on on he array. 

A^hough pr imarily described in terms of a single detection b.ock, e.g., for detection of a s.ng.e P*m^« • « 
orefer ed aspects the arrays of the invention will .nclude multiple detection blocks, and thus be capable of analyzing 
m^rs^^irJmor^tams. For examp.e, preferred arrays w... genera.ly include from about 50 to about 4000 
Zl ent detec %ZSEZ?«* particularly preferred arrays including from 100 to 3000 different 

ralternate arrangements it will generally be understood that detection blocks may be grouped within a single 
a „l o?r™lp ™l™e arrays so that varying, optima, conditions may be used during the hybrid.zat.on of the 
fa qeUo the array For examp.e, it may often be desirable to provide for the detection of those polymorph.™ that fa. 
winin G-C rich^retches of a genomic sequence, separately from those failing in A-T rich segments. Th.s allows for 
the separate optimization of hybridization conditions for each situation. 

IV. Calling 

After hybridization and scanning, the hybridization data from the scanned array is then analyzed to^ ^^ti,y which 
variant or variants of the polymorphic marker are present in the sample, or target sequence, as determined from the 
D ?obes to w^ich the targethybridized, e.g., one of the two homozygote forms or the heterozygote form. Th.s determ.- 
n tion , teTmed -ca ^genotype. L.ing the genotype is typica.ly a matter of comparing the hybnd.zat.on dafa 



20 



25 



30 



35 



45 



SO 



55 



5 



BNSDOCID: <EP 0785280A2_L> 



EP 0 785 280 A2 



<,orhetero,ygo,es>,hat are present ^"' 'J^S^^i^p^ lor a W ».rlanl versus .hal 

„e. as bribed, MM Typical,, the ,ra ^ra o - b we« J ^ ^ h \ t „= zygo , c5 lyplc a.» 

second ratio -typically be MM, Xf^Jg^ZfiZZw* <h. seecilic sequence eurroondino 



described comparisons. PYPr ute the software ot the present invention which 

o, hoorozygote «ith , second ya™,„, ot a » ££££ Moos .1 as moose bot.oos 

As in Fig. 5, computer system 1 includes mon.tor 3 and ^^MOB J gda ter 10B , removable disk 112. 

ss i-i-i — --~rr.r " — " ■ 

LSS52=5ES5S£3SSS 

reSP A Sot h etica,exam^ 

TCAAG whereas another allele conta.ns the subsequence TC £*^™™™°* uXions at base positions wit hin two 
phism in each allele. Fig. 8 shows a probe array '-'^ base A (corn- 

base positions of the polymorphic marker. In the fust wo columns he .eel wn.cn , co p 
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lor carrying ool Ma analysis is raprM.d ir iWM" <W«*| * aeteImilw0 „ „ g Ihe wensity „, ma oolla at 

P, ^r^r^TX^^ ^ -.p. , sVo ,0,00, maron a.a., 9 o lo, 

"u,.^ ; 1 ': 

involves testing whether the maximum of the perfect match averages for x or y is at '^st twice .as ore 

results and this is communicated to the U8 « r . aooroaching - near 0, or approaching In practice, 

rrSU«»a.^por^* m a, k a,0«,in 9l o 
" - A *o.nnsp.o,K»i o, ma ft* action m slap »4 M £ S^STS^ 

in a 0 when the logarithm of the Ratio is calculated. invention These equations have 
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intensities, including, e.g., principal component analys is and the , modifications to the 
Those ot skill in the art, upon reading the instant ^ure w J ^'^VspW oTscope of the invention. For 
above described methods and arrays may be made wrtho ^^^^^aSrScular genome. Alterna- 
example, one may select the strand of the targe ^^^^^7^ r^which a call can be made. 

A a°; a ^ - d - r - mixed po,ymers and 

The present invention is further H.ustrated by the following examples. These examp.es are merely to illustrate 
aspec^s ome present invention and are not intended as .imitations of this ,nvent,on. 



V. Examples 
Example 1- Chip Tiling 



ADNA ch 1 pispreparedwhichcon = 
or biallelic markers, in a segment of human DNA (the mrgei J complementary to the 7th, 10th 

cleotide array containing all 78 detection blocks is shown .n ^Rgure , 2A. ^ 
The tiling strategy for each block substitutes bases ,n ^^^^^^^ ,„ pairs differing at the 
the po.ymorphism. In addition to the substituted P rabes > a,,OWS ,0r C ° n " 

biallelic base. Thus the ^he d^ instrumentation. 

TSZX^ZSE: « wT Refection block is shown in Figure 3, for a specific poly- 
s morphic marker denoted WI-567. 

FYam ple 2- Detection of Poly morphisms 

taH trnm prR Drodu cts amplified by primers flanking the markers. These amplicons 
A target nucleic acid is generated from PCR P roa . uct * a ™P""; nr ll. ced as SS . DNA by asymmetric PCR from one 
can be produced singfy or in mu.tip.exed reasons -J^^LSJJT^!^ promoted .inked to the primers, 
primer flanking the polymorphs, as ds-DNA, or as RNA "f"^"^ " n-bearinq nucleotides. Biotin labelled target 
Fluorescent or biotin label is introduced into target ^jf^^^^^^^ containing nucleotides. In 
is then bound after amplification using 33££ S^Sd of the primer. 

DNA produced by symetric or asymetnc PCR ^T^^^S^^S^ are carried out with standard 
Hybridization of target to the arrays tiled ,n Ex amp e 1 ^ «JJ" ^ ^ 9 ^ te solvent (forma- 
so.utions of sa.t (SSPE, TMAC1) and nomonic ^ e ^^ (37-40°C; 
mide). Targets and markers generating strong signals are washed under s nng |e J jvj |ower 

10% formamide; 0.25xSSPE washes) to give highly f^^^^^cf^S'spE; 6x and fx SSPE 
hybridization intensity are washed under less stringent cond.t.ons (<30 C. 3M TMAoi 

washes) to yield highly discriminating d * ec ^*^™°W° e 3 Sp ecificalfy, a typical detection block is shown for 

Detection of one polymorphs marker is '""f ated , " F 'f '?.^ 
the polymorphism denoted WI-1959, hav.ng the sequence ^AAAMT^ 
w*h the polymorphism indicated homozygote and heter- 

erozygote, C/C homozygote and m ^^^ t ^^ VIBte tj ,e d with each chip including the illustrated detection 

a sufficiently low background intensity for ad ^ate detect hybridized to the de- 
F.uorouraci. ^ |^ X ^3?^ , S^ , S«SS lucSn temperature. As shown in the scan 
tection array m SxSSPE + Trrton-X100 at 30 C, ana homozygote or 10 heterozygote features. 

Figure 3, midd.e, f.uorescent scans of the ^^^^X^Z oi Carity and understanding, it wi.l 
While the foregoing invention has been described I in some deta to W » can fae made 

be clear to one skilled in the ad from a reading of t ^isdosu J^^pSSS^l cited in this app.ication 

as " each indiv,dual pub,ication or 

patent document were so individually denoted. 
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BEGIN ( 
rawa-cutcf f » 1 - 
pat toggle - •yes* 
base(01=-r- 
basa[11«*3" 
base[2]="2' 
bas*[3>*A" 
aame[0.o: = •Wl-ai-' 
hex [CO] = *CTA«iCC * 
name [1,3] = m °*l'*' 
hextl.O] = -TCAvi^.' 
name[2,0] - -W2 
hex(2.01 = * 
name[3.0] = -Vil--:*.' 
hex [3,0] = -XArTAA" 
name [ 4,0] = 
hex £4,01 - " CTr~- % ' ' 
nane(5.C] « -wi-»::' 
hexCS.C] - 
name [ 5 . C ' * " W I - . 
iiexlo.Cl - ' 2A^* 
r.ame[7.0; = "Wr-i:* " " 

hex i 7 . o; * -ac^a^:* 

name [8.0] - •v:-ii--' 
hex [ e . 0 1 - •TTCTA-" 
name 19. 01 - "WX-i*- 
hex[9.0) * • TTCTTT • 
name[0.1] * •Wi-l^f.' 
hexiO.U = 'AAA — r 
namall.ll - «w:-H~ 
hextl.il » •GTrrrr* 
n*m*[2. 1 ] » "WI- " '* 

hext2.ll * -TA.r-- 
name[3.i: « 
hext3.ll * -ATG*C\- 
name [4.-1] - -w: - l v 
hex [4.1] » -TTrrrr* 
camelS,!] « -wt P*' 4 
hex[5,l] = TTCr-V 
name[6.11 - "WW"*-" 
hex[6.-l - * 
name: 7 ( 1] = -Wl-1 j* 
hexfMl = -ACCACA- 
name(3,l] * -Wi-l-*'- 
hex [8.1] = -TCCJAVA- 
name [9,11 = -WX-i>-J' 
hex[9.1] = -CAACiA-" 
name[0,21 * •WI-lii- < 
hex[G-. 2". = "AAC7GA' 
name [1.2] = •wi-2011* 
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hex[l,2] - " GACTGT" 
name[2.2] = -WX-2564' 
hexl2.21 = " GGAGAG * 
5 nameC3,2] = -WI-4013' 

hex[3,2] = "CTAGTG" 
naae(4.2] = "WI-7567- 
hex [4, 2] = -TAGTGA- 
name[5.2] = -WI-11595' 
hex(5.2] = " TAG AG C - 
10 name [6. 2] = -CM4.16' 

hex [6, 2] = " GATAAT" 
name;?, 2] * -WI-6704 
hex [7, 2] = -ACTCCA- 
r.ame[3.2] = -WX-6721" 
hex[3,2] = -GGCACA- 
75 r.am*£9.2] = "WI-6787' 

hex(9,2] = * ACAGTT " 
name CO. 31 = -WI-5310" 
hexlO.33 - -TAGTTG- 
na»e[1.31 = -WI-9513" 
20 hex[1.3] = -TTGATT- 

name [2.3] = " ADH3 " 
hex(2.3] = "ATAGTT- 
name (3. 3] = " AGT" 
hex [3,3] = -GACTGG- 
name(4,2] = -ALBOB-1" 
25 hex{4,3] - ' TTCTGG * 

name[5.3l - -ALD03-2" 
hex[5.3] = -CCAGAT- 
name[6.3] = "A?CB* 
hex [6, 3 3 = -ACTCCT- 
nameC7.3] = "APOE (152T/C) • 
hex[7,3] = ' TGTCGC " 
naneCa.3] = -APCE (290T/C )« 
hex [3, 3] = * AGTCGC " 
name [9. 3] = " ARSB " 
hex 19,3] » -TCGATG- 
35 r.ame[0,4] = • ATla ' 

hex 10. 4] = -CTTCCC- 
name[1.4] = "ATlb" 
hextl.4] - -GCAcrr- 
name[2.41 = "BCL2 * 
hex(2.4] = -ACGAGG- 
40 naweC3.4] = "BRCAla " 

hex[3.4] - -CATCTG- 
name[4 < 4] = "BRCAlb* 
hex{4,4] = -AGAGAG- 
name[5.4] = -BRCAlC 
5 hex 1 5, 4] = -GAAGAG- 

name[6,4] = "D3S2' 
hex[6.4) = • CCAGGT " 
r.ame[7.4] = -D3S11* 
hex [7. 4] = "TCTGRR" 
nameC8.4] = "D3S12- 
SO hex[8,4] = " CCAGGG " 

name 19. 4] = " DRD2 " 
hex(9,4l = -CACTGG- 
name [0. 5] - •FABP2 " 
hexlO.5] * • GCGACT " 

55 
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name[l,5] = " GCK" 
hex(i,51 = " GAG ACA " 
name [2. 51 = "HT2" 
_ hex(2,51 = " CTGTGG " 

name[3,51 = "HT4" 
hex[3,51 = "TGCAAT" 
name[4,5] = ' HT5 * 
hex 14. 5} = " ACTCGA " 
name[5.5] = -IGF2" 
10 hex [5, 5] = " GGGACC " 

name[6.51 = "IGHV4-5- 
hex [6, 51 = -TCTCGA- 
name[7,5] = " INS " 
hex 17, 5 3 = -TCTACC- 
name[8,Sl = ' LDLH " 
15 hex[8.5l = " GGCT AA " 

name[9.Sl - -LP79" 
hex [9, 51 = -CCAGGG- 
nameCO.61 = *L?L" 
hex[0,51 = " AGCTAG * 
name [1.61 = " MCC " 
hexCl.Sl = " GCCTGA" 
n ame[2, 6] = "METH " 
hex '2.51 = "CCCTGG- 
name(3.61 = "NRAMP" 
hex[3,51 = -CAGATG- 
25 name[4,6] = - PA^' 

hex [4. 5] = " ACATTG " 
name[5.5] = "Per/RCS- 
hex{5,61 = -GAAGGA- 
name(6.51 = -PPP3R1" 
hex[S,6] = ' GACTAA " 
30 na.meC7.61 = "RDS " 

hex[7,6] = -AGGACG' 
name[B,61 - 'sl4544- 
hex[8,61 = -TCTGCT- 
name[9.6] - 'S180A- 
35 hex [9. 61 = * GGC ATG " 

name [0.7] * -TcR-CAl* 
hex [0.7] = -TGCGGT- 
name [1,71 = -TcR-CB22' 
hex(l,7l = " GGCTGG " 
name [2, 7] = •TCR-CB23 " 
40 hex[2.7l = " CTCTAG * 

name (3, 7] » -TcR-CB24" 
hex[3,71 = -GTGATG- 
name[4,7] = -TCR-CB25- 
hext4.7 3 = -GTAGCC- 
name[5.7l - -TcR-CB27« 
hex[5,7] » -ACCTTA- 
name[6,7] = -VBl2a" 
hex[6.71 = -ACAGTG- 
■ name[7,7] = -VBl2b" 
hex[7.71 = -CACTCA- 
50 bkgsiim = 0 

bkgnum = 0 

} 
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( 

readthis - 1 

if (SI - / IA--.*-t I ' t ■ CI - .' fA-'a-z]/) readthis = 0 
5 if {readthis »- U ; a»U:a ' SI. $2] = 33 

if ($i>2 &i $:»*) if .;:<::2 $2<i24) if <$i<90 || $2<io9) 
{ 

px =■ in- : ; 51 - J ' ID 
py = mt : c si -si ~» 
pxo -- ti:*p*i o 
10 pyo « '15*pv 

mx - 51 -f-x^ 
by * SI-FV" 

fclock • 3 # ;.r.t .fc/ EJ » -7 
if tfay* 4 * • « 4 soc ■» 10) 

* 5 %t - — :ty*51 

n; p* . r lork. sb.mx] = $3 

) 

ii ihy%* 4 .. aix 1C) 

t 



20 



35 



40 



45 



SO 



55 



) 



) 

end: 

25 printf 



. , v , . . . %* n - L*ss-im/b)cgriu*T\> 

pr.ncf -K~>-*F c *<10;px~] if (py < 7 | 1 px < 8) 
{ 

r.;c) t s.irs--r r- * * 

m ;i} . av^a'.r \* - . • " • * ' 

30 m{7] « sur « t r • r * : ' : • ■ 

ml 21 • su^i'r *-* ■ »* * •••*'' 

m(4 ] s * ••.*.. ' * 

m^] - euz*tr ;» :\ * - J 

ml 61 - suT*t: . • - 11 



n[9: - » .»-■■ 

center - ~. i IJ • / "suhstr • hexlpx,py] ,4,1) 

ocntajre- . i : *» . * ' "»[61 81 

header I *: ' ria^lpx.pyl ■ \n- pentamer -\a- 

head;r::.* - - 
I 

tor , • * * - * ' * 



r« pv- block, f] = o 

: . . j«-J 3^4;g*+) maxlolpx. py. block. g.fl 
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10 



15 



25 



30 



40 



for <k=C;k<=9;k+-) for ;b=0 ; b<=3 ;b+* : 
( 

= = iat(k/2) 

Signal = sig{?x. py, block, base [b] , kj 
omit - 0 

if (mtk] - basetb] ) emit = 1 
if (omit == 1) 

q = maxhi (px.py. block. =1 

if (signal > q) maxhi [px. py. block, z =signa- 



if (omit ■« O 

q = maxlo [px.py. block, b, =1 

if (signal > q) maxlo [px. py, block. b, =] -s-snal 
if <k%2 0) 
i 

num2 «■= signal 
x2 + « (signal) "2 
nl>^ 

20 ) 

if (k%2 == li 
{ 

der.2 + = signal 
x2 + « (signal) "2 
n2 *•■*• 

} 

if (omit 1) if (k==4 |i k==5) 
{ 

if (base(b) subs:r (hextpx. pyJ , 3 . 1) ) 
{ 

muni -= signal 
> 

if (baseCb] == substr (hex[px.pyj , 4 , D ) 
{ 

den 1 += signal 

35 . } 

> 

} 

maxhisum = 0 
for Cf=0;f<5;f++) 
{ 

maxhisum maxhi [px, py. block, f) 

} 

maxhiav = maxhisum/ 5 
maxlosum = 0 

for (g=0;g<5;g++) for (v=0 ; V<4 ; > 
45 ' 

maxlosum *= maxlo Ipx. py, block. v. g] 

) 

maxlo av = maxlosum/ 14 
maxrat = maxhiav/maxloav 
num = ( inuml/2) - (num2/nl) ) 
SO if (num < 0) num - 0 

den = ( (denl/2) - (den2/n2) ) 
if (den <= 0) den » C.001 
ratio = nua/den 
max = numl / 2 

55 
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if (denl/2 > max) max = denl/2 
n = nl+n2 

scdvxnum = ( (n*x2) - {num2+deii2 ) ~2 ) 
if (stdvxrmm < 0) sccivx = 0 
stdvx = (stdvxnum/ (n"2 ) ) ~ (0 . 5 ; 
if (maxrat > ratpatcutoff | I pattoggle = ; 
{ 

if (headprint == 0) 



printf •\t20/-bloc3c-\t- 
priatz C%i.2f\C", racio) 
if (ratio < 10000) printf "\t" 
rat - ratio 

if (ratio == 0) rac = .00001 
lograt = leg ( rat) /log (10 ) 
printf f%2.2f\t", 10-lograt) 
printf ("%2.2f, max/stdvx) 
if (max/stdvx < 2) printf " \ tFAlI/ 
if (max/stdvx >^ 2) printf "\t\C 
printf C%2.2f", maxrat) 



if (maxrat > ratpatcutof f ) printf -\fCOOOPAT- 



10 



printf header 
headprint = 1 



25 



printf "\n" 



> 



) 



30 



35 



40 



45 



50 



55 
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Claims 



said sequence that is complementary to said polymorphs, wherein n ,s from 0 to 5. 



2. The array of claim 1 , wherein n is 2. 



The array of Cairn 1 , wherein said first and second groups of probes comprise a p.ura.ity of different probes that 
^ complementary to overlapping portions of said target nucleic acd sequence. 

4 The array of Cairn 1 , wherein said detection block further comprises said third and fourth groups of probes wherein 
laid m^osubswutions occur at a plurality of distances from a ar end of sa.d probes. 

5 The array of Cairn 1 , wherein said detection block includes between about 8 and 88 different probes, 
sequence. 

8 A method of identifying whether a target nucleioacid sequence includes a po.ymorphic variant, comprising: 

to identify said polymorphic variant. 

9. The method of claim 8, wherein said target nucleic acid comprises a detectable label. 

10. The method of claim 9, wherein said detectable label is a fluorescent group. 

11. The method of claim 9, wherein said label is a binding group. 

12 The method of claim 11 , wherein said binding group is selected from biotin, avidin and streptavidin. 

14. The method of claim 8, wherein said step of determining comprises: 

casing a ratio of hybridization intensity of said target nucleic acd to > ^ first group of probes versus 
hybridization intensity of said target nucleic acid to said """^ 2 a homozygote to r said 
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FIG. 2A 
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SEQUENCE AT POLYMORPHISM WI-567 



5 ' 

TGCTGCCTTGGTTC [h AGCCCTCATCTCTTT 



THREE BLOCKS OF COMPLEMENTARY 

IW^^ 20-MER OLIGOS WITH SUBSTITUTIONS (N) 
ipmjW 7 < 10 AND 13 BP FR0MTHE 3 ' END 



p-2 P-l P P+l P+2 
A G A G A G A G A G 



m 



BASES IN THE SHADED COLUMNS 



3 • BLOCK 20/7 5 ' 

AACCAAN [ C ] TCGGGAGTAGAG 



m 



3' BLOCK 20/10 5' 
CGG AACCAAN [ C ] TCGGGAGTA 



m 



3' BLOCK 20/13 5' 

CGACGGAACCAAN [ C ] TCGGGA 



FIG. 2B 
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SEQUENCE AT POLYMORPHISM WI-567 



5 ' 



TGCTGCCTTGGTTC [J] AGCCCTCATCTCTTT 



20 MER OUW frqm JHE 3 . END 



-ACGGAACCANG [T J TCGGGAGT 
,-, r rnr^n.nnn.r,T 



ACGGAACCANG [C ] TCGGGAGT 

CGGAACCAAN [T] TCGGGAGT A 

CGGAACCAAN [ C ] TCGGGAGTA 

GGAACCAAG [■] TCGGGAGTAG 

GGAACCAAG [N] TCGGGAGTAG 

GAACCAAG [ T ] NCGGGAGTAGA 

• GAACCAAG [ C ] NCGGGAGTAGA 

AACCAAG [ T ] TNGGGAGTAGAG 

— AACCAAG [ C ] TNGGGAGTAGAG 



It 



P-2 P-l P P+l P +2 



AGAGAGAGAG 











































































PREDICTED PATTERNS 

GAG AGAGAGAGAG AGAGAGAGAG 



T 
G 
C 
A 



GENOTYPE: 



Kffl ■ i™ 

A/A G/G 



FIG. 3 
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P-2 P-l ?0 P+l P+2 
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FIG. 7 

T C AG 
AG AG AG AG AG 




FIG. 8 



(" START ) 



RECEIVE CELL 
INTENSITIES 



1 



CALCULATE RATIO = 
PM(x) - MM(x) 
PM(y) - MM(y) 



CALCULATE DB = 
10 * log-io RATIO 



Y 



202 



-204 



PERFORM STATISTICAL 
CHECK ON DATA 



-208 




210 



1 



212- 



HOMOZYGOTE 
WITH FIRST 
MARKER 



NEAR 0 



HETEROZYGOTE 



HOMOZYGOTE 
WITH SECOND 
MARKER 



-216 



-214 



( END ) 
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(54) Polymorphism detection 



(57) The present invention generally provides a rap- 
id efficient method for analyzing polymorphic or bialletic 
markers, and arrays for carrying out these analyses. In 
general, the methods of the present invention employ 
arrays of oligonucleotide probes that are complementa- 
ry to target nucleic acids which correspond to the marker 



sequences of an individual. The probes are typically ar- 
ranged in detection blocks, each block being capable of 
discriminating the three genotypes for a given marker, 
e g., the heterozygote or either of the two homozygotes. 
The method allows for rapid, automatable analysis of 
genetic linkage to even complex polygenic traits. 
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